The Garp Architecture and C Compiler

نویسندگان

  • Timothy J. Callahan
  • John R. Hauser
  • John Wawrzynek
چکیده

62 Computer The Garp Architecture and C Compiler V arious projects and products have been built using off-the-shelf field-programma-ble gate arrays (FPGAs) as compute accelerators for specific tasks. Such systems typically connect one or more FPGAs to the host computer via an I/O bus. Some have shown remarkable speedups, albeit limited to specific application domains. Many factors limit the general usefulness of such systems. Long reconfiguration times prevent acceleration of applications that spread their time over many different tasks. Low-bandwidth paths for data transfer limit the usefulness of such systems to tasks that have a high compute-to-memory-bandwidth ratio. In addition, standard FPGA tools require hardware design expertise beyond the knowledge of most programmers. To address the bandwidth problems, some developers have proposed integrating specially designed rapidly reconfigurable hardware more closely with the processor. To help investigate this idea we designed our own architecture in detail, called Garp, 1 and experimented with running applications on it. We are also investigating whether Garp's design enables automatic, fast, effective compilation across a broad range of applications. Our results so far have been promising. Garp combines a single-issue MIPS processor core with reconfigurable hardware to be used as an accelerator. We designed both the reconfigurable hardware and the interfaces among the system components, tailoring them for general-purpose computing. Garp is designed to fit into an ordinary processing environment that includes structured programs, subroutine libraries, context switches, virtual memory, and multiple users. The Garp chip does not exist as real silicon. We have, however, completed critical parts of the integrated circuit layout and performed circuit simulation to give us good estimates of Garp's clock speed, power consumption, and silicon area for a sample implementation. We designed Garp with the intent that its reconfig-urable hardware would accelerate loops of general-purpose programs. This goal led to the following decisions about Garp: • We decided that a few cycles of overhead for transferring data between processor registers and the reconfigurable hardware would be acceptable, since this overhead would occur only at the entrance and exit of loops. • The reconfigurable hardware needed its own direct path to the processor's memory system, since most nontrivial loops operate over memory data structures. Relying on the main processor to shuffle data between the reconfigurable hardware and memory would be unacceptable; the processor would act as a bandwidth bottleneck and also add cycles of latency to every access. • The reconfigurable hardware …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Garp: a MIPS processor with a reconfigurable coprocessor

Typical reconfigurable machines exhibit shortcomings that make them less than ideal for general-purposecomputing. The Garp Architecture combines reconfigurable hardware with a standard MIPS processor on the same die to retain the better features of both. Novel aspects of the architecture are presented, as well as a prototype software environment and preliminary performance results. Compared to ...

متن کامل

Performance Evaluation of GARP Model in Forecasting the Probability of Gully Erosion Occurrence in Abgendi Basin, Kohgiluye & Boyerahmad Province

The present study has evaluated the efficiency of GARP model in forecasting the probability of gully erosion occurrence in Abgendi basin, Kohgiluye & Boyerahmad province. After field visits, 81 gullies were recorded by GPS and surveying camera, then the map of gully erosion distribution was prepared in ArcGIS environment. Variables of elevation, slope, slope direction, distance to road, distanc...

متن کامل

The Caenorhabditis elegans GARP complex contains the conserved Vps51 subunit and is required to maintain lysosomal morphology

In yeast the Golgi-associated retrograde protein (GARP) complex is required for tethering of endosome-derived transport vesicles to the late Golgi. It consists of four subunits--Vps51p, Vps52p, Vps53p, and Vps54p--and shares similarities with other multimeric tethering complexes, such as the conserved oligomeric Golgi (COG) and the exocyst complex. Here we report the functional characterization...

متن کامل

A Novel Multiply-Accumulator Unit Bus Encoding Architecture for Image Processing Applications

In the CMOS circuit power dissipation is a major concern for VLSI functional units. With shrinking feature size, increased frequency and power dissipation on the data bus have become the most important factor compared to other parts of the functional units. One of the most important functional units in any processor is the Multiply-Accumulator unit (MAC). The current work focuses on the develop...

متن کامل

Dual roles of the mammalian GARP complex in tethering and SNARE complex assembly at the trans-golgi network.

Tethering factors and SNAREs control the last two steps of vesicular trafficking: the initial interaction and the fusion, respectively, of transport vesicles with target membranes. The Golgi-associated retrograde protein (GARP) complex regulates retrograde transport from endosomes to the trans-Golgi network (TGN). Although GARP has been proposed to function as a tethering factor at the TGN, dir...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Computer

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2000